Avoid parsing joint rule codes as distinct codes in `# noqa` #12809

charliermarsh · 2024-08-11T23:07:25Z

Summary

We should enable warnings for unsupported codes, but this at least fixes the parsing for # noqa: F401F841.

InSyncWithFoo · 2024-08-11T23:20:33Z

is_alphanumeric() also allows non-ASCII-digits and letters:

# noqa: A三

Did you mean to use is_ascii_alphanumeric() instead?

charliermarsh · 2024-08-11T23:22:57Z

Seems reasonable to exclude non-ASCII.

codspeed-hq · 2024-08-11T23:28:48Z

CodSpeed Performance Report

Merging #12809 will not alter performance

_{Comparing charlie/parse (c2e9746) with main (f837428)}

Summary

✅ 32 untouched benchmarks

InSyncWithFoo · 2024-08-11T23:31:14Z

On that same note, is_whitespace() and trim_start() / trim_end() also handle Unicode whitespace. They should be replaced with is_ascii_whitespace() and trim_ascii_start() / trim_ascii_end(), accordingly.

"ASCII whitespace" is defined as "space, tab, LF, FF, CR". In our case, since Python comments don't allow line breaks, that boils down to just spaces and tabs.

github-actions · 2024-08-11T23:36:50Z

`ruff-ecosystem` results

Linter (stable)

ℹ️ ecosystem check detected linter changes. (+3 -0 violations, +0 -0 fixes in 2 projects; 52 projects unchanged)

apache/airflow (+3 -0 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --no-preview --select ALL

+ providers/src/airflow/providers/apache/hdfs/sensors/hdfs.py:45:37: RUF100 [*] Unused `noqa` directive (unknown: `Ignore`)
+ providers/src/airflow/providers/apache/hdfs/sensors/hdfs.py:50:38: RUF100 [*] Unused `noqa` directive (unknown: `Ignore`)
+ providers/src/airflow/providers/google/cloud/hooks/bigquery.py:59:42: RUF100 [*] Unused `noqa` directive (unknown: `Used`)

pandas-dev/pandas (+0 -0 violations, +0 -0 fixes)

Changes by rule (1 rules affected)

code	total	+ violation	- violation	+ fix	- fix
RUF100	3	3	0	0	0

Linter (preview)

ℹ️ ecosystem check detected linter changes. (+3 -0 violations, +0 -0 fixes in 1 projects; 53 projects unchanged)

apache/airflow (+3 -0 violations, +0 -0 fixes)

ruff check --no-cache --exit-zero --ignore RUF9 --output-format concise --preview --select ALL

+ providers/src/airflow/providers/apache/hdfs/sensors/hdfs.py:45:37: RUF100 [*] Unused `noqa` directive (unknown: `Ignore`)
+ providers/src/airflow/providers/apache/hdfs/sensors/hdfs.py:50:38: RUF100 [*] Unused `noqa` directive (unknown: `Ignore`)
+ providers/src/airflow/providers/google/cloud/hooks/bigquery.py:59:42: RUF100 [*] Unused `noqa` directive (unknown: `Used`)

Changes by rule (1 rules affected)

code	total	+ violation	- violation	+ fix	- fix
RUF100	3	3	0	0	0

charliermarsh · 2024-08-11T23:41:38Z

But Python comments can contain non-ASCII whitespace? We already have an is_python_whitespace that respects the spec for tokens: https://docs.python.org/3/reference/lexical_analysis.html#whitespace-between-tokens

InSyncWithFoo · 2024-08-11T23:57:29Z

But Python comments can contain non-ASCII whitespace?

Fair enough, though I still don't see why someone would want to use non-ASCII whitespace in the noqa part of a comment. A user might want to write an explanation in their native language in the same comment, sure, but noqa and the rule codes are never localized.

InSyncWithFoo · 2024-08-12T03:42:39Z

One more thing: ParsedFileExemption::try_extract() currently treats A001 , B002 (whitespace before comma) and A001,,B002 (empty slot) as if B002 doesn't exist. Is this difference in behaviour intended?

dhruvmanila · 2024-08-12T03:58:32Z

crates/ruff_linter/src/noqa.rs

+
+    #[test]
+    fn noqa_empty_comma() {
+        let source = "# noqa: F401,,F841";


I think we should show a warning in this case but still parse the remaining codes (if that's possible currently). This would be similar to how we recover from a missing element in list parsing: https://play.ruff.rs/308ab356-9355-43f3-99b7-12c7c2de7334.

charliermarsh added bug Something isn't working suppression Related to supression of violations e.g. noqa labels Aug 11, 2024

charliermarsh force-pushed the charlie/parse branch from 0584d32 to 149bb2e Compare August 11, 2024 23:10

charliermarsh force-pushed the charlie/parse branch from 149bb2e to 383676e Compare August 11, 2024 23:23

dhruvmanila reviewed Aug 12, 2024

View reviewed changes

dhruvmanila approved these changes Aug 12, 2024

View reviewed changes

charliermarsh force-pushed the charlie/parse branch 2 times, most recently from 464bb82 to 5894aa0 Compare November 2, 2024 20:14

Avoid parsing joint rule codes as distinct codes in # noqa

c2e9746

charliermarsh force-pushed the charlie/parse branch from 5894aa0 to c2e9746 Compare November 2, 2024 20:15

charliermarsh enabled auto-merge (squash) November 2, 2024 20:15

charliermarsh merged commit 35c6dfe into main Nov 2, 2024
18 checks passed

charliermarsh deleted the charlie/parse branch November 2, 2024 20:25

BrewTestBot mentioned this pull request Nov 8, 2024

ruff 0.7.3 Homebrew/homebrew-core#197081

Merged

BryceBeagle mentioned this pull request Nov 9, 2024

0.7.3 introduces false positives for invalid # noqa directives with RUF100 #14228

Closed

InSyncWithFoo mentioned this pull request Nov 14, 2024

[ruff] Unformatted special comments (RUF037) #14111

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid parsing joint rule codes as distinct codes in `# noqa` #12809

Avoid parsing joint rule codes as distinct codes in `# noqa` #12809

charliermarsh commented Aug 11, 2024

InSyncWithFoo commented Aug 11, 2024 •

edited

Loading

charliermarsh commented Aug 11, 2024

codspeed-hq bot commented Aug 11, 2024 •

edited

Loading

InSyncWithFoo commented Aug 11, 2024 •

edited

Loading

github-actions bot commented Aug 11, 2024 •

edited

Loading

charliermarsh commented Aug 11, 2024

InSyncWithFoo commented Aug 11, 2024 •

edited

Loading

InSyncWithFoo commented Aug 12, 2024 •

edited

Loading

dhruvmanila Aug 12, 2024

Avoid parsing joint rule codes as distinct codes in # noqa #12809

Avoid parsing joint rule codes as distinct codes in # noqa #12809

Conversation

charliermarsh commented Aug 11, 2024

Summary

InSyncWithFoo commented Aug 11, 2024 • edited Loading

charliermarsh commented Aug 11, 2024

codspeed-hq bot commented Aug 11, 2024 • edited Loading

CodSpeed Performance Report

Merging #12809 will not alter performance

Summary

InSyncWithFoo commented Aug 11, 2024 • edited Loading

github-actions bot commented Aug 11, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

charliermarsh commented Aug 11, 2024

InSyncWithFoo commented Aug 11, 2024 • edited Loading

InSyncWithFoo commented Aug 12, 2024 • edited Loading

dhruvmanila Aug 12, 2024

Choose a reason for hiding this comment

Avoid parsing joint rule codes as distinct codes in `# noqa` #12809

Avoid parsing joint rule codes as distinct codes in `# noqa` #12809

InSyncWithFoo commented Aug 11, 2024 •

edited

Loading

codspeed-hq bot commented Aug 11, 2024 •

edited

Loading

InSyncWithFoo commented Aug 11, 2024 •

edited

Loading

github-actions bot commented Aug 11, 2024 •

edited

Loading

`ruff-ecosystem` results

InSyncWithFoo commented Aug 11, 2024 •

edited

Loading

InSyncWithFoo commented Aug 12, 2024 •

edited

Loading